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Abstract 

Linear extended top-down tree transducers (or synchronous tree-substitution grammars) are popular formal 
models of tree transformations. The expressive power of compositions of such transducers with and without 
regular look-ahead is investigated. In particular, the restrictions of nondeletion, e-freeness, and strictness 
are considered. The composition hierarchy turns out to be finite for all e-free (all rules consume input) 
variants of these transducers except for nondeleting e-free linear extended top-down tree transducers. The 
least number of transducers needed for the full expressive power of arbitrary compositions is presented. 
In all remaining cases (incl. nondeleting e-free linear extended top-down tree transducers) the composition 
hierarchy does not collapse. 
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1. Introduction 

The top-down tree transducer is a simple formal model that encodes a tree transformation (i.e., a relation 
on trees). It was introduced in (22l . |23| and intensively studied thereafter (see fl3l - f]~5l | for an overview). 
Roughly speaking, a top-down tree transducer processes the input tree symbol-by-symbol and specifies in 
its rules, how to translate an input symbol into an output tree fragment together with instructions on how 
to process the subtrees of the input symbol. This asymmetry between input (single symbol) and output 
(tree fragment) was removed in extended top-down tree transducers (xt), which were introduced and studied 
m (II- 01- In an xt the left-hand side of a rule now contains an input tree fragment, in which each variable can 
occur at most once as a placeholder for a subtree. In particular, the input tree fragment can even be just a 
variable, which matches every tree, and such rules are called e-rules. In this contribution we consider linear 
xt (1-xt), in which the right-hand side of each rule contains each variable at most once as well. Restricted 



variants of 1-xt are used in most approaches to syntax-based machine translation [171 Il8l | . 

We also add regular look-ahead |8fl (i.e., the ability to check a regular property for the subtrees in an input 
tree fragment) to 1-xt, so our most expressive model is the linear extended top-down tree transducer with 
regular look-ahead (l-xt R ). Contrary to most of the literature we present our model as a synchronized 

grammar |5j because we sometimes use the auxiliary link structure in our proofs. Instead of variables in 
the left-hand side and a state-variable combination in the right-hand side of a rule, we immediately only 
use states with the restriction that each state can occur at most once in the left-hand side and at most 
once in the right-hand side. Moreover, all states that occur in the right-hand side must also occur in the 
left-hand side. In this way, for each rule the states establish implicit links (a state links its occurrence in 
the left-hand side with its occurrence in the right-hand side) , which form a bijection between a subset of the 
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state occurrences in the left-hand side and all state occurrences in the right-hand side. The state occurrences 
(in the left-hand side) that do not participate in the bijection (i.e.. those states that exclusively occur in the 
left-hand side) can restrict the acceptable subtrees at their position with the help of regular look-ahead Q. 
The implicit links in a rule are made explicit in a derivation, and a rule application expands (explicitly) 
linked state occurrences at the same time. Example [5] shows an l-xt R , for which we illustrate a few derivation 
steps in Figure [21 The tree transformation computed by the example l-xt R is shown in Example [5J In the 
following, we use 1-XT R and 1-XT to denote the class of all tree transformations computed by l-xt R and 1-xt, 
respectively. 

The expressive power of the various subclasses of 1-XT R is already well understood (l2l fltj ]. However, 
in practice complex systems are often specified with the help of compositions of tree transformations [2~0 | 
because it is much easier to develop (or train) small components that manage a part of the overall transfor- 
mation. Consequently, J 18] and others declare that closure under composition is a very desirable property 
for classes of tree transformations (especially in the area of natural language processing). If a class C of 
tree transformations is closed under composition, then any composition chain r% ; ■ • • ; r„ of tree transfor- 
mations n, . . . ,T n of C can be replaced by a single tree transformation r G C. If C represents the class of 
all tree transformations computable by a device, then closure under composition means that we can replace 
any composition chain specified by several devices by just a single device, which enables an efficient modular 
development. Unfortunately, neither 1-XT R nor 1-XT are closed under composition pi. I3I Il6| . 

In general, for a class C of tree transformations (that contains the identity transformation) we obtain a 
composition hierarchy C C C 2 C C 3 C • • • , where C n denotes the n-fold composition of C. The class C might 
be closed under composition at power n (i.e., C n — C n+1 ) or its composition hierarchy might be infinite (i.e., 
C n C C n+1 for all n). In the former case, we say that the composition hierarchy of C collapses at power n, 
which also yields that C n — C m for all m > n. In particular, C is closed under composition if its composition 
hierarchy collapses at power 1. We note that in practice (e.g., in machine translation) the classes that are 
closed under composition at a small finite power are also important because for such classes we can limit 
the length of composition chains (2l| . In this contribution, we investigate the composition hierarchy of the 
classes 1-XT R and 1-XT together with their subclasses determined by the properties: e-freeness, strictness, 
and nondeletion, which are abbreviated by 'e", 's', and 'n', respectively. Roughly speaking, e-freeness yields 
that all rules are e-free, strictness guarantees that the right-hand side of each rule contains an output symbol, 
and nondeletion requires that for each rule exactly the same states occur in the left- and right-hand side. 
We use the property abbreviations in front of 1-XT R and 1-XT to obtain the class of all tree transformations 
computable by such restricted l-xt R and 1-xt, respectively. For instance, esl-XT R denotes the class of all 
tree transformations computed by e-free and strict l-xt R . 

It is known that none of our considered classes is closed under composition 0, Section 3.4]. In addition, 
it is known that esnl-XT = e'snl-XT 11 is closed at power 2 [6, Section II-2-2-3-3]. We complete the picture 
as follows. For each of the remaining classes, we either provide the least power at which the class is closed 
under composition or show that the composition hierarchy of the class is infinite (denoted by 00). Our 
results (together with the mentioned existing result) are presented in Table [1] 

Our contribution is organized as follows. Section [5] recalls the necessary concepts and introduces our 
notation. We continue in Section [3] with the formal introduction of our main model (l-xt R ) including its 
syntax and semantics and the restrictions that we consider later. In addition, we recall some known equalities 
between certain fundamental classes of tree transformations in preparation for our first main results. In 
Section@]we give a power at which the classes esl-XT, esl-XT R , el-XT, and el-XT R of tree transformations 
are closed under composition (cf. Table [IJ . This is completed in Section [5J where we conclude that the 
presented powers (Table [IJ are minimal. Finally, in Section [S] we prove that the composition hierarchy of 
the remaining classes is infinite. 

2. Notation 

We denote the set of all nonnegative integers by N. The set of all finite words (finite sequences) over a 
set S is S* = UngN wnere S° = { £ } contains only the empty word e. The length of a word w £ S* is the 
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Table 1: Characterization of the composition hierarchies. 



unique n G N such that w G S n . We write \w\ for the length of w. The concatenation of two words u, to G S* 
is denoted by u.iu or simply vw. 

Every subset of S X T is a relation from 5 to T. Given relations i?i C S x T and i? 2 G T x J7, the 
inverse of i?i is the relation i?^ 1 = {(i, s) | (s, i) G Ri}, and the composition of i?i and i?2 is the relation 

Ri;R 2 = {{s,u) \3teT: (s,t) G Ri,(t,u) G i? 2 } • 

These notions and notations are lifted to classes Ci and C2 of relations in the usual manner. Namely, we let 
cr 1 = {Ri 1 1 #1 e c i} and 

Ci ; C2 = ; i?2 I Ri G Ci, i?2 G C2} . 

Moreover, the powers of a class C are defined by C 1 = C and C ,l+1 = C™ ; C for n > 1. The composition 
hierarchy (resp. composition closure) of C is the family (C n | n > 1) (resp. the class [J n>1 C"). If C ,l+1 = C n , 
then C is closed under composition at power n. For n — 1 we shorten this to just C is closed under 
composition. If C is closed under composition at power n, then Ui<i<n^ 1 ^ s ^ ne composition closure of C. 
Moreover, we note that if C contains the identity relations, then C™ C C n+1 for all n > 1, so that in this case 
C n is the composition closure of C provided that C is closed under composition at power n. Our classes C of 
tree transformations will always contain the identity relations. 

An alphabet S is a nonempty and finite set, of which the elements are called symbols. The alpha- 
bet S is ranked if there additionally is a mapping rk : S — s> N that assigns a rank to each symbol. We let 
Sfc = {a G £ rk(<r) = fc} for every k G N. Often the mapping 'rk' is obvious from the context, so we 
typically denote ranked alphabets by S alone. If it is not obvious, then we use the notation crW to indicate 
that the symbol a has rank k. For the rest of this paper, S, A, and T will denote arbitrary ranked alphabets 
if not specified otherwise. 

For every set T, let 

E(T) = {tr(ti,...,t fc ) I a G E fc ,ti,...,t fc G T} . 

Let S be a set with S fl E = 0. The set of T.-trees with leaf labels S is the smallest set £/ such that 

S C {/ and £(£/) C {/. We write Ts for Ts(0), and any subset of Ts is a tree language. The height ht(t) of 
a tree £ G 7s (5) is defined such that ht(s) = for all s G 5 and 

ht(a(ti, . . .,ifc)) = 1 + max {ht(tj) 1 < i < fc} 

for all a G Efc and ti,. . . ,tk G Ts(S'). Given a unary symbol 7 G £1 and a tree t G T^(S), we write 7 fe (£) 
for the tree 7(- • • ^(t) ■ ■ ■). 

k times 

The set pos(£) C N* of positions of t G Ts(S) is inductively defined by pos(s) = {e} for every s G S and 



fe 

pos(<r(ii, . . . ,t fc )) = {e} U |J{«w I to G pos(tj)} 



for every a G E/. and ti,...,tk G T^(S). For words u,u> G N*, we denote the longest common prefix of 
u and if by \cp(v,w). Note that lcp(u, it>) G pos(i) for all v, w G pos(i) because pos(i) is prefix-closed. The 
positions pos(t) are partially ordered by the prefix order ^ on N* [i.e., v ^ w if and only if v = lcp(u, w)\. 
The size \t\ of the tree t G T^(S) is |pos(t) | ; i.e., the number of its positions. Let t G Ts(S') and w G pos(i). 
The /abeZ of £ at w is t(w), and the w-rooted subtree of i is i^. Formally, s(e) = s\ £ — s for every s G S 1 and 

..la if u; = e It ifw = e 

t(w) = < , and i| u , = < 

I if w = it; and i G N I t%\ v if to = iv and iei 

where i = a(ti, ...,£&) with <r G E^ and ti,.. . ,tk G T-z(S). For every selection [ICS of leaf symbols, we let 
pos u (t) = {w G pos(t) | t(w) G C/} and pos s (t) = pos^ ,,}.(£) for every s e S. The tree £ is linear (resp. non- 
deleting) in {/ if |pos u (i)| < 1 (resp. pos u (i)| > 1) for every u G U. Moreover, var(t) = {s G S | pos s (t) / 0}. 
The expression t[u] w denotes the tree that is obtained from t G by replacing the subtree t\ w at w 

by u G T E (S). 

Let U C 5 be finite, t G T S (S'), and 6>: U -t {L \ L C T E (S)}. We define the tree language t6 by 
induction as follows. 

• s9 = s for all s G S\U, 

• s6 = 6(s) for all s eU, and 

• for all a G E^ and ti, . . . , tk G Ts(S) 

a(tx, tk)B = {a(ui, ...,u k )\ui£ ti8, ...,u k G t k 9} . 

For every neN we fix the set X„ — {x\, . . . , x n } of variables, which we assume to be disjoint from all ranked 
alphabets considered in the paper. Given t G Ts(S) and r : X n — > T^(S), we write t\r(xx), . . . , r(x n )] for 
where #(xj) = {r(xi)} for all 1 < i < n. 

A iree homomorphism from E to A is a family of mappings (h k \ k G N) such that ftfc : Efc — > Ta(^Oc) 
for every fc G N. Such a tree homomorphism is 

• linear if for every a G E^ the tree h k {o~) is linear in 

• complete if for every cr G E& the tree hk(o~) is nondeleting in X k , 

• strict if /ife: E^ — > A(T&(Xk)) for every k G N, and 

• delabeling if /i/. : E^ — >• X/. U A(Xfc) for every fc G N. 

We abbreviate the above restrictions by T, 'c', 's', and 'd'. The tree homomorphism (hk \ k G N) induces a 
mapping ft,: T^(S) — > Ta(S') defined inductively by ft(s) = s for all s G S and 

fc(<r(*i, . . .,**)) = h k (o-)[h(h), h(t k )} 

for all a G Efe and t\, . . . ,t k G T^(S). As usual, we also call the induced mapping h a tree homomorphism. 
We denote by H the class of all tree homomorphisms, and for any combination w of T, 'c', 's', and 'd' we 
denote by w-H the class of all ui-tree homomorphisms. For instance, lcs-H is the class of all linear, complete 
and strict tree homomorphisms. 



In the following, we need regular tree languages [1J, Il5l | and basic results for tree automata, which 
recognize the regular tree languages. The set Reg(r) contains all regular tree languages L C Tr over the 
ranked alphabet T. A particular well-known result, which we use, states that tO G Reg(r) for all t G Tp(U) 
and 9: U — > Reg(r). A bimorphism is a triple B = (tp, L, ip) consisting of a regular tree language L G Reg(r), 
an input tree homomorphism ip : Tr — > ■ and an output tree homomorphism tp : Tr Ta . The tree 
transformation B C Ts x Ta computed by the bimorphism B is the relation B — {(t/}(t),ip(t)) | t G L}. 
Given two combinations v and w of restrictions for tree homomorphisms, we let B(u, w) denote the class of 
all tree transformations computed by bimorphisms B — L, ip) such that tp and <p are tree homomorphisms 
of type v and w, respectively. 



3. Linear extended top-down tree transducers 

Our main model is the linear extended top-down tree transducer [l], 0, U3; H3| with regular look-ahead 
(l-xt R ), which is based on the classical linear top-down tree transducer without [22|, and with regular 
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look-ahead [8j. We will present it in a form that is closer to synchronized grammars |5| because we will use 
an auxiliary structure, called the links, in a later proof. In synchronous grammars, equal states in the left 
and right-hand side of a rule are implicitly linked, and equal states are explicitly linked in a derivation, in 
which such linked states will be replaced at the same time by a rule for that state. In a rule of an l-xt R , 
these implicit links form a bijection between a subset of the states in the left-hand side and all states in the 
right-hand side. Thus, some states might exclusively occur in the left-hand side. For those states we can 
use the regular look-ahead to restrict the subtrees that are acceptable at these positions. 

Definition 1 (see (id . Section 2.2]). A linear extended top-down tree transducer with regular look-ahead 
(l-xt R ) is a tuple M = (Q, S, A, I, R, c), where 

• Q is a finite set of states, 

• £ and A are ranked alphabets of input and output symbols, 

• I C Q is a set of initial states, 

• R C Ts(Q) x Q x Ta(Q) is a finite set of rules such that £ and r are linear in Q and var(r) C var(£) 
for every (£, q, r) G R, and 

• c: Q —> Reg(S) assigns regular look-ahead to each state. 

It is worth noting that we assign look-ahead to each state, but in a derivation we will onl y u se the look- 
ahead if the state is deleted (see Definition [S]). Moreover, in contrast to other definitions [IH Hdj. we do not 
allow the same state to occur several times (neither in the left- nor in the right-hand side). However, with 
the help of a simple renaming, each traditional linear extended top-down tree transducer can be written in 
our slightly more restrictive format. Next, we recall some important syntactic properties of our model. To 
this end, let M = (Q, S, A, /, R, c) be an l-xt R in the following. It is 

• a linear extended tree transducer (without look-ahead) [1-xt], if c(q) = for every q S Q, 

• a linear top-down tree transducer with regular look ahead [l-t R ] if £ G £(Q) for every (£,q,r) G R, 

• a linear top-down tree transducer (without look ahead) [1-t] if it is both an 1-xt and an l-t R , 

• e-free if £ ^ Q for every (£, q, r) G R, 

• strict if r ^ Q for every {£, q, r) G R, 

• a delabeling if £ G £(Q) and r G Q U A(Q) for every (£, q, r) G R, 

• nondeleting if var(r) = v&r(£) for every (£, q, r) G R, and 

• a finite-state relabeling [qr] if it is a nondeleting, strict delabeling 1-t such that pos p {£) = pos p (r) for 
every (£, q,r) G R and p G var(r). 

We note that each l-t R and each 1-t is automatically e-free. To simplify the notation in examples and 
illustrations, we sometimes write rules as £ — — > r instead of (£,q,r). Similarly, we write £ r as a 
shorthand for the two rules £ r and £ r. Moreover, for every p G Q and {£, q,r) G R we identify 
poSp(^) and pos p (r) with their unique element if the sets are non-empty. Since £ and r are linear in Q, there 
is indeed at most one element in pos p (^) and pos p (r). Finally, for every q G Q, we let 

R q = {peR\p=(£,q,r)} 

be the subset of R that contains all rules for state q. 

Example 2. Let us consider the l-xt R Mi — (Q, S, A, {*}, R, c) given by 

• Q = {*,P,QS<7',id,id'} ; 

. S = { CT ( 2 ), 7 f ) , 7 ( 1) }UA ; 

. A = {af\af )7 W,a(°)}, 

• R contains exactly the following rules 

vi(p,q) o-i{p,q) 
a(q, id) q 

7 (id) '^4 7 (id) 



cr 2 (id, id') — ^ er 2 (id,id') 
o-(q',q) q 



71 b) — >p 
72(g) <? 



id. id' 

a — > a 
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/\ 

p q 



A 



A 

g' q 



Figure 1: Illustration of two rules with their implicit links. 



• c(q') = {t G T s | pos CT2 (<) ^ 0} and c(/) = T s for all other states f. 
It can easily be checked that c(g') is a regular tree language. The l-xt R M\ is an e-free, nondeleting, delabeling 
top-down tree transducer with regular look-ahead. 

Next, we recall the semantics of the l-xt R M = (Q, £, A, I, R, c), which is (mostly) given by synchronous 
substitution. While the links in a rule are implicit and established due to occurrences of equal states, we 
need an explicit linking structure for our sentential forms. These links will form dependencies that are used 
in a proof later on. To encode the links, we store a relation between positions of the input and output tree. 
Let C = {D | D C N* x N*} be the set of all link structures. First, we define general sentential forms. 
Roughly speaking, a sentential form consists of an input tree and an output tree, in which positions are 
linked. 

Definition 3 (see (ill . Section 3]). An element (£,D, £) G ?s(Q) x C x Ta(Q) is a sentential form (for 
M) if v € pos(£) and w G pos(£) for every (v, w) G D. For a set S of sentential forms we define 

links(5) = {D | (£, D,Q G S for some trees £ and £} . 

Now we formalize the implicit links in a rule p by defining the explicit link structure determined by p. 
This explicit link structure is added to the link structure of a sentential form whenever the rule p is applied 
in the derivation process. 

Definition 4. Let p G R be the rule (£,q,r), and let v, w G N*. The explicit link structure of p for the 
positions v and w is defined as 

links^Go) = {(u.pos p (^),u;.pos p (r)) | p G var(r)} . 

Example 5. Let us compute two explicit link structures. 

15nkBi,2i (ffi (p,«) ^WP,<?)) = {(11, 211), (12, 212)} 
links 1 , 21 (<7(g',g)-^g) = {(12,21)} 

In illustrations we use grayed splines to indicate the links. The rules above and their implicit links [i.e., 
those of links £)E (j))] are displayed in Figure^ 

The derivation process is started with a simple sentential form (q, {(e, e)}, q) consisting of the input and 
output tree q for some initial state q G / and the trivial link relating both roots. The current sentential 
form can evolve in two ways. Either we (nondeterministically) apply a rule (£, q, r) to a pair (v, w) of linked 
occurrences of the state q or we apply the look-ahead. In the former case, such a rule application replaces 
the linked occurrences of q by the left and right-hand side of the rule. The explicit link structure of the rule 
for v and w is added to the current (explicit) link structure to obtain a new sentential form. Since we are 
interested in the dependencies created during derivation, we preserve all links and never remove a link from 
the linking structure. This replacement process can be repeated until no linked occurrences of states remain. 
Thus, we obtain an output tree without states, but the input tree of the sentential form can still contain 
states, which do not take part in an active link (i.e., a link relating two states in the sentential form). Each 
occurrence of such a state q can simply be replaced by any tree of the regular look- ahead tree language c(q), 
where different occurrences of the same state can be replaced by different trees of c(q). 
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Figure 2: Derivation using the 1-XT R Mi of Example [2] 



Definition 6 (see [11, Section 3]). Given two sentential forms and (£',£)',£'), we write 

if 

• there exists a rule p = (£,q,r) £ R and input and output positions (v,w) £ D PI (pos g (£) x pos ? (£)) 
such that 

£' = £[4> C = CM» D' = £Ulinks K ,4p) , 

• or there exists a position u G posg(£) with w £ posg(£) for all (v,w) £ D and there exists t £ c(j£(v)) 
such that 

e = ([t\v c = c = 

A few derivation steps using the l-xt R Mi of Example [2] are illustrated in Figure [2] Next, we define the 
tree transformation computed by an l-xt R . 

Definition 7. The l-xt R M computes the dependencies dep(M), which are given by 

dep(Af) = {(t,D,u) £T^xCxT A \3q£l: (q,{(s,s)},q) (t,D,u)} , 

where =^* M IS the reflexive and transitive closure of =>-m- Moreover, it computes the tree transformation M, 
which is given by M = {(t,u) \ (t,D,u) £ dep(M)}. 

Example 8. Let Mi be the l-xt R of Example^ Then 

(a i (71 (er 2 (a, a)) , 72 (0(7(01) ,a 2 (a,a)))),D,ai (a 2 (a, a), cr 2 (a, a))) £ dep(Mi) 

where 

D = {(e,e), (1,1), (11,1), (111, 11), (112, 12), (2,2), (21,2), (212,2), (2121,21), (2122,22)} , 

7 



which corresponds to the final element in the derivation displayed in Figure [H Roughly speaking, the tree 
transformation computed by Mi accepts only input trees of a certain shape. In such a tree it removes all 
ji-symbols on the left spine between the symbols o~± and 02. In addition, it also removes all symbols to the 
right of o~\ except for those belonging to the left-most subtree rooted by <y% . 

Since every translation (t, u) S M is ultimately created by (at least) one successful derivation, we can 
inspect the links in the derivation process to exhibit the dependencies. Roughly speaking, the links establish 
which parts of the output tree were generated due to a particular part of the input tree. This correspondence 
is called contribution in [10] . 

Definition 9. To allow concise statements, we introduce the following short-hands: 

$ = e-free s = strict d = delabeling 11 = nondeleting . 

We use these abbreviations in conjunction with l-xt R to restrict to devices with the indicated properties. For 
example, dnl-xt stands for "delabeling and nondeleting linear extended top-down tree transducer" (without 
look-ahead). We use the same abbreviations with the stem (i.e., the material behind the hyphen) in capital 
letters for the corresponding classes of computed tree transformations. For instance, dnl-XT stands for the 
class of all tree transformations computable by dnl-xt, and QR denotes the class of all tree transformations 
computable by qr. 

Our device develops the input and output tree from top to bottom, which yields the name top-down 
tree transducer. Similarly, there exists a device called linear extended bottom- up tree transducer (0, which 
creates the trees from the leaves to the root (i.e., bottom- up) 0. We do not recall the device formally here, 
but we use the abbreviations 1-xb and 1-XB together with the usual properties for it. We refer the reader 
to for full details. 

Next, we recall two important known statements for the class 1-XT R . The first statement relates the 
classes 1-XT R and 1-T R to 1-XB and 1-B, respectively. In addition, we demonstrate that look-ahead is 
superfluous in the nondeleting case. We use the brackets '[' and ']' for optional use of the restrictions ^, 'd', 
's', and 'n' that have to be consistently applied. 

Theorem 10 ([HI, Proposition 3.3 and Corollary 4.1]). 

M[s][d][n]l-XT R = [d[ 8 ][d][n]l-XB W[d][n]l-T R = [s][d][n]l-B 

[^|[s][d]nl-XT R = [^][s][d]nl-XT [s][d]nl-T R = [s][d]nl-T 

To illustrate the consistent application of optional restrictions, the following statements are instances of 
the first result of the above theorem: 

1-XT R = 1-XB and ^sl-XT R = ^sl-XB . 

Secondly, we relate the class 1-XT R to 1-T R , which tells us how to emulate linear extended top-down tree 
transducers with regular look-ahead by linear top-down tree transducers with regular look-ahead |8]. 

Theorem 11. 

^[s][d][n]l-XT R = 1CS-BT 1 ; [s][d][n]l-T R [s][d][n]l-XT R = lc-IT 1 ; [s][d][n]l-T R 

Proof. The statements are easily obtained from Theorem ITOl and [12, Lemma 4.1]. 
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4. Four classes that are closed at a finite power 

In this section, we show that the classes jZl-XT R , jd-XT, ^sl-XT R , and ^sl-XT are closed under composi- 
tion at a finite power. We first recall a central result [3| that proves that the (very restricted) class ^snl-XT 
is closed under composition at power 2. Note that [3fl expresses this result in terms of Dimorphisms, and 
'strict' is abbreviated by 'e' there. In fact, ^snl-XT = B(lcs, lcs) by and fl9L Theorem 4]. 

Theorem 12 (0, Theorem 6.2]). For every n > 2, 

j&nl-XT C ^snl-XT 2 = jZsnl-XT" . 

Now we establish our first composition result, which is analogous to the classical composition result for 
linear top-down tree transducers with regular look-ahead [8| . The only difference is that our first transducer 
has extended left-hand sides (i.e., it is an l-xt R instead of just an l-t R ). Since this fact does not affect the 
original composition construction [7|, we can use the original result to obtain our first result. 

Lemma 13 (composition on the right). 

[^[s][d][n]l-XT R ; [s][d][n]l-T R = [^[s][d][n]l-XT R 

Proof. By Theorem \TT\ [^][s][d][n]l-XT R = lc[s]-H _1 ; [s][d][n]l-T R where the option in the left-hand 
side corresponds to the option [s] in lc[s]-H . Thus, we obtain the chain of equalities 

[^[s][d][n]l-XT R ; [s][d][n]l-T R = lc^-H" 1 ; [s][d][n]l-T R ; [s][d][n]l-T R 

= lcfsj-H- 1 ; [s][d][n]l-T R 
= M[s][d][n]l-XT R , 

where in the second step we applied the following well-known composition result for linear top-down tree 
transducers with regular look-ahead [8]H 

[s][d][n]l-T R ; [s][d][n]l-T R = [s][d][n]l-T R . □ 

In the next result we present a decomposition that corresponds to property P of [6|. It demonstrates 
how to simulate an l-xt R by a delabeling l-d R and an ^snl-xt, for which we have the composition closure 
result in Theorem H"2"1 

Lemma 14 (decomposition). 

^[s]l-XT R C [s]dl-T R ; szfenl-XT 

Proof. Let M = (Q, S, A, /, R, c) be an arbitrary e-free l-xt R . Moreover, let to £ N be such that 
to > |var(r)| for every [£, q, r) G R; i.e., the integer to is larger than the maximal number of states in 
the right-hand side of the rules. For every rule p = (£,q,r) £ R and non-state position w £ pos s (£) in its 
left-hand side, let 

usedp(tu) = {i £ N | wi £ pos(£), vsx{l\ wi ) n var(r) ^ 0} 

be the indices of the direct subtrees below w in t, which still contain states that occur in r. We construct a 
delabeling l-xt R 

Mi = (Q 1 ,E,flU{@ i | < i < m},I u Rx,cx) 

such that 

• Qi = {{p, w) | p = (I, q,r) £R,w £ pos(£)}, 

• rk(p) = |used p (e)| for every rule p £ R and rk(@j) = i for every < i < m, 



3 The abbreviation 'd' has a completely different meaning in @|. 
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• h = | q G I,p G Eg}, and 

• for every rule p — (£,q,r) G i? and non-state position w G pos s (£) we construct the following rule 
of-R x : 

{(p,wii) if r G Q 

p((p, wix),...,(p, wi n }) if r (£ Q,w = e 
@ n ((p, wh), ■ • ■ , (ft, wi n )) otherwise, 

where k = rk(£(w)) and {ii, . . . , i„} = used p (w) with ii < ■ ■ ■ < i n , 

• for every rule p — (£,q,r) G i?, state position w G posg(£) and rule p' € Ri( w ) we construct the 
following rule of i?i : 

{p,s) — > {p,e} , 

• ci((p, w)) — (£\w)c for every p — (£, q,r) G R and w G pos(^). 

To obtain the desired l-t R we simply eliminate the e-rules using standard methods. The elimination is 
successful because the e-rules are non-cyclic@ Intuitively speaking, the transducer Mi processes the input 
and deletes subtrees that are not necessary for further processing. Moreover, it marks the positions in the 
input where a rule application would be possible. Finally, it also already executes all erasing rules of M. 
Let m! G N be such that m! > \£\ for all (£, q, r) G R. We construct the 1-xt 

M 2 = (Q, R U {@ z | < % < m}, A, /, R 2 ) 

such that i?2 contains all valid rules (p(t\, . . . ,tk), q,r) with p = (£, q,r) G R of a strict ln-xt with pos^(^) = 
and \ti\ < m' for every 1 < i < k, where k = rk(p). □ 

Example 15. A full example for the construction of Lemma \14\ would be quite lengthy, so let us only 
illustrate the construction of Mi on the example rule p: 

o-(ft, <r(a, q)) a(a, a(q, a)) , 
for which the only non-trivial look-ahead is c(p) — L. For this rule we construct the following rules in M\: 

<7«p,l),(p,2» { H P {{p,2)) (p',e) (p',s) 

*((p,2l), (p,22» ^ @ 1 ({p, 22)) a ^ @ (p",e) (p",e) 

for all rules p' G R p and p" G R q . Moreover, the look-ahead c\ of M\ is such that 

Cl ((p,l)) = L Cl ((p,21}) = {a} . 

Now we are able to prove that the class gl-XT R is closed under composition at the third power. 

Theorem 16. For every n > 1, 

(g'[s]l-XT R )™ C [s]dl-T R ; g-snl-XT 2 C (g'[s]l-XT R ) 3 . 

Proof. The second inclusion is trivial, so we prove the first inclusion by induction over n. The statement 
is a trivial consequence of Lemma HPf! for n = 1. In the induction step, we obtain 

(4s]l-XT R )" +1 = ^[s]l-XT R ; (^[s]l-XT R ) n 

C 4s]l-XT R ; [s]dl-T R ; g'snl-XT 2 



4 Note that due to the e-freeness of M, we have w ^ e in the e-rules of the second item. Since these rules are the only 
constructed e-rules, we cannot chain two e-rules. 
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(p,e) (p,e) =>m x / \ 



p a p 

> M 1 / \ | =>Mi 



(p,l) (p,2) (p,2) t (p,2) (p,2) 



A I /\ I /\ 

@i * /\ ® x * /°\ ® x 

(p,21) (p,22) ( P)22 ) « (p, 22) fa 22 ) a <p", e > ( p « e ) 

Figure 3: Derivation using Mi of Example 1 1 5 1 for some t a L and p" £ Rg. 

= ^[s]l-XT R ; ^snl-XT 2 

C [s]dl-T R ; jzfenl-XT ; ^snl-XT 2 

= [s]dl-T R ; ^snl-XT 2 

where we used the induction hypothesis in the second step, Lemma 1 1 31 in the third step, Lemma 1 141 in the 
fourth step, and finally, Theorem in the last step. □ 

It is known that we can simulate an l-t R (with look-ahead) by a composition of two 1-t (without look- 
ahead). This fact allows us to obtain the following corollary for the closure of the class ^1-XT under 
composition at power four. 

Corollary 17 (without look-ahead). For every n > 1 

^[s]l-XT™ C QR ; [s]dl-T ; ^snl-XT 2 C ^[s]l-XT 4 . 

Proof. The second inclusion is trivial, and for the first inclusion we use [s]dl-T K C QR ; [s]dl-T and 
Theorem HH □ 

Up to now, we have shown that the classes ^1-XT R and ^1-XT are closed under composition at the third 
and fourth power, respectively. In the rest of the section, we will show that the (strict) classes ^sl-XT R and 
^sl-XT are closed under composition already at the second power. We start with a lemma that proves the 
converse of Lemma QJ] in the strict case. 

Lemma 18 (composition on the left). 

lds-H ; ^sl-XT R C ^sl-XT R 

Proof. Let d: Ts — ► Ta be a strict delabeling linear tree homomorphism. Recall that d also defines 
a tree transformation d: T^(Q) — > Ta(Q), which acts as an identity on states; i.e., d(q) — q for every 
q G Q. Moreover, let M = (Q, A, T, I, R, c) be a strict l-xt R and m G N be such that m > \£'\ for every 
£' G d^ x (£) and (£,q,r) G R. Note that the set <i _1 (£) is finite because d is strict. We construct the l-xt R 
M' = (Q U {1, . . . , m}, S, r, /, R', c') such that for every rule (£, q,r) G R we have each valid rule (£', q, r) 
in R' where £' G d^it) and |pos E (f )| = |pos A (£)|. Moreover, c'(q) = d^ 1 (c( g )) for all q G Q and c'{i) = T E 
for every 1 < i < m. Clearly, d~ l (c{q)) is a regular tree language because c(q) is a regular tree language and 
regular tree languages are closed under inverse tree homomorphisms 0, Lemma 1.2]. Finally, we observe 
that M' is also strict since it has the same right-hand sides as M. □ 

Now we state and prove the mentioned result for the class ^sl-XT R . The general approach is the same 
as in Theorem 1161 but we now use Lemma H51 in the main step. 

Theorem 19. For every n > 1 

(^sl-XT R ) Tl C ^snl-XT ; ^sl-XT R C (^sl-XT R ) 2 . 
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Class 


Closed at power 


Proved in 


fl-XT R 


3 


Theorem [TBI 


fl-XT 


4 


Corollary [T71 


^sl-XT R 


2 


Theorem HU 


{/si-XT 


2 


Corollary HOI 



Table 2: Summary of the results of Section HI 



Proof. Again, the second inclusion is trivial. For the first inclusion, the induction basis is also trivial, and 
we prove the induction step as follows 

(^sl-XT R )™ +1 = (^sl-XT R )™ ; ^sl-XT R 

C j&nl-XT ; ^sl-XT R ; /sl-XT R 

= /snl-XT ; /snl-XT ; lds-H ; ^sl-XT R 

C ^snl-XT 2 ; ^sl-XT R 

= jZsnl-XT 2 ; ^snl-XT ; lds-H 

= ^snl-XT 2 ; lds-H 

= ^snl-XT ; ^sl-XT R 

using the induction hypothesis in the second step, Lemma fTS] in the fourth step, and Theorem [l_2] in the 
sixth step. The final step composes the tree homomorphism with the second tree transducer using the result 
of Lemma fTS] □ 

Again, we derive the "without look-ahead"- version of the general result. In contrast to Corollary \17\ the 
power of closedness does not increase by one for strict ^1-xt. 

Corollary 20 (without look-ahead). For every n > 1 

^sl-XT" C jzW-XT ; ^sl-XT C ^sl-XT 2 . 

Proof. The second inclusion is trivial. For the first inclusion, we observe that 

jZsl-XT™ C jZsnl-XT ; ^sl-XT R 

= lcs-H" 1 ; snl-T ; QR ; ^sl-XT 
= lcs-H" 1 ; snl-T ; ^sl-XT 
= ^snl-XT ; jzfel-XT 

using first Theorem Hill and then Theorem ITTI and (a, Theorem 2.6] in the second step. Next, we use the 
composition result [L, Theorem 1] for top-down tree transducers and Theorem [TTJ again. □ 

Table [5] summarizes our results concerning the powers at which the considered classes are closed under 
composition. 

5. Least power of closedness 

In this section, we will determine the least power at which each of the classes ^1-XT R , ^1-XT, ^sl-XT R , 
and ^sl-XT is closed under composition. 
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Theorem 21. For every n > 2 

^sl-XT C ^sl-XT 2 = ^sl-XT" (^sl-XT R ) C (^sl-XT R ) 2 = (^sl-XT R )™. 

PROOF. By [3, Section 3.4], the classes ^sl-XT and ^sl-XT R are not closed under composition at power 1. 
Moreover, by Theorem fUJl and Corollary [50] both classes are closed at power 2. □ 

In the following, we will use the computed dependencies, for which we observe some important properties 
next. For the rest of this section, let M — (Q, E, A, /, R, c) be the considered l-xt R . To simplify the 
development, we disregard the actual input and output trees in the computed dependencies and will consider 
only the set links(M) = links(dep(M)). We say that M computes the linking structures of links(M). 

Definition 22 ([20, Definition 8]). A linking structure D G C is input hierarchical if for all links 

(vi,wi), (v 2 ,w 2 ) E D 

• if V\ -< V2, then w\ -< W2 , and 

• if v\ = V2, then w\ ■< W2 or W2 ^ wi. 

Input hierarchical linking structures have no crossing links, which are links (v\,w\), (v2,w 2 ) £ D such 
that v\ -< V2 and W2 -< ui\. We define output hierarchical using the same conditions as in Definition l22l for the 
output side; i.e., D is output hierarchical if and only if D~ x is input hierarchical. Moreover, D is hierarchical 
if it is both input and output hierarchical. Finally, a set T> C C of linking structures is hierarchical if each 
element D G V is hierarchical. 

We also need a property that enforces the existence of certain links. Roughly speaking, for a set of 
linking structures T> C C there should be an integer that limits the distance between links in each linking 
structure D G V. 

Definition 23 ([20, Definition 10]). A set 2? C £ of link structures has bounded distance in the input 
if there exists an integer k G N such that for every D G V and all (v, w), (vv" , w") G D there exists 
(vv', w') G D with v' ^ v" and \v'\ < k. 

In other words, in a set of link structures with bounded distance k in the input, we know for any of 
its link structures that between any two source-nested links there exists a link such that the distance of 
its source to the smaller link's source is at most k. This yields that the distance to the source of the next 
nested link (if such a link does exist) can be at most k. Note however, that the above property does not 
require a link every k symbols. As before, a set T> C £ of linking structures has bounded distance in the 
output if D^ 1 = {D~ x I D G £>} has bounded distance in the input. Finally, T> has bounded distance if it 
has bounded distance in both the input and the output. 

Lemma 24. For every l-xt R M, the set links(M) is hierarchical with bounded distance. 

Proof. This lemma follows trivially from Definition [5] 

Next we consider the problem whether a tree transformation can be computed by an l-xt R . For this 
we specify certain links that are intuitively clear and necessary between nodes of input-output tree pairs. 
Hereby we obtain the specification sentential forms. Then we consider whether this specification can be 
implemented by an l-xt R . Often we cannot identify the nodes of a link exactly. In such cases, we use splines 
with inverted arrow heads, which indicate that there is a link to some position of the subtree pointed to. 
For example, the splines in Figure @] indicate that a node of t\ (resp. t^) on the left is linked to a node of t\ 
(resp. t<i) on the right. 

Theorem 25. (^1-XT R ) 2 C (^1-XT R ) 3 = (^1-XT R )™ for every n > 3. 



5 This notion is called strictly input hierarchical in \'2(i \. 
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Figure 4: Links with inverted arrow. 




Figure 5: Illustration of the relevant part of the specification used in the proof of Theorem 1251 



Proof. The inclusion is trivial and the equality follows from Theorem [TCI so we only have to prove strict- 
ness. Recall the l-xt R M\ of Example E] In addition, we use the two Dimorphisms B 2 and B$ of @, 
Section II. 2. 2. 3.1], which are in B(lcs,lcs)0 As mentioned, ^snl-XT = B(lcs, lcs), hence B 2 and B$ can also 
be defined by some ^snl-xt M 2 and M3, respectively. For convenience, we present M 2 and M3 explicitly 
before we show that r = Mi ; M 2 ; M3 cannot be computed by a composition of two ^l-xt R . 
Let M 2 = (Q 2 , A, T, {*}, R 2 ) be the ^snl-xt with 

• Q 2 = id,id'} and T = 7 «, a®}, and 

• R 2 is the set of the rules 

ci(*> C2(id, id')) — > cr(a(*, id), id') <72(id,id') — — > <r(id, id') 

1X id, id' 1X id, id 7 

7(id) — > 7 (id) a — > a . 

Moreover, let M 3 = (Q 3 , T, A, {*}, i? 3 ) be the ^snl-xt with 

• Q3 = {*,]?, id, id'}, 



In [i| strictness is denoted by 'e'. 
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• i?3 is the set of the rules 



a(p,id) a 1 (p, id) o-(a(p, id), id') -A <7i(p, <r 2 (id, id')) 7 (id) '-^ 7 (id) 

7(id) 7(id) a '— ^ a . 

We present a proof by contradiction, hence we assume that r = Ni ; A2 for some fl-xt Ni and A^. 
With the help of Lemma [2U we conclude that links (ATi) and links(A2) are hierarchical with bounded 
distance ki and k2, respectively. We consider (t,u) G r such that the left ci-spines of t and u are longer 
than k\ and ^2, respectively^ Moreover, by assumption, there exists an intermediate tree s and linking 
structures D\,Di G C such that (t,Di,s) G dep(ATi) and (s,D2,u) G dep(A^2). We specify some links that, 
due to the last two inclusions, should be necessarily present in D\ and D2 and show that they lead to a 
contradiction (see Figure [5]). 

First, we consider the links D2 generated by A^- Since the left eri-spine in u is longer than /C2 and there 
are links at the root (i.e., (e, e) G D2) and inside t n , there must be a linking point at position w G pos a (u) 
along the left ai-spine with w/e, which links to position v' in the intermediate tree s (i.e., (v',w) G D2). 
Let u\ w — ai (it', (72(ti+i) ti))- We assume that every tj with j < n is high enough (i.e., ht(ij) > fc 2 )- This 
yields that there exist linking points in tj+i, ij, and of u. Let (w.- +1 , w;+i), (u|, lOj), (^_i, u?i_i) G D2 be 
those linking points. Since D2 is hierarchical and itfj+x and Wi are below u> in u, we know that v' i+1 and 
are below v' in s (i.e., u' < v' i+1 ,v' i ), whereas v' ^ Next, we locate U in the input tree t. By the general 

shape of t, the subtree U occurs in a subtree <7i(t', cfo^^i, *i-i)]) f° r some tree c G Te(Xl) with exactly one 
occurrence of x\. Again we assume that c is suitably larg^l, which forces a linking point inside c in addition 
to those in U, and tj_i because (t, s) G A^. Using the same arguments for N\ as before for A^, we now 
locate the links (y,v") G D\ linking into c, which dominates the links (yi,v"), (j/i-i> £ -Di linking to 
and ti—i, respectively. Thus, v" ^ v", v'[_ x . In addition, there is a link (j/i+i, G D\ linking into tj+i, 
which is not dominated by v". Consequently, v" v" +1 . Finally, let 

Vi-i = lcpK_ 1; v"_ x ) Vi = lcp(^, u") v l+1 = lcp(v' i+u v" +1 ) . 

It is easy to see that the relevant links can be chosen such that 

v' ■£ Vi-i v' ^.Vi v' < v i+ i 

v" < v" ^ Vi v" 2< u i+1 . 

Now, we have either lcp(«j_i,Uj) ^ lcp(uj, «i+i) or lcp(i>j, Wj+i) ^ lcp(wj_i, Uj). In the first case we get 

u" ^ lcp(Wi_i,Uj) ^ lcp(«i,W i+ i) ^ , 

and in the second case we get 

v' < lcp(Vi,V i+ l) d: lcp(Vi-l,Vi) ^ Vi-i , 

which are contradictions. □ 

Fortunately, we can use the proof of the previous theorem to conclude that four is the least power at 
which the class ^1-XT is closed under composition. 

Corollary 26. jrt-XT 3 C ^-XT 4 = ^-XT" for every n > 4. 



7 Some additional requirements on t and u are developed in the proof later on. 
8 More precisely, we assume that the only element of pos^ (c) is longer than k\. 
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Table 3: Summary of the results of Sections HI and [51 



Proof. The inclusion is trivial and the equality follows from Corollary |T7J For the strictness, the proof of 
Theorem [23] essentially shows that in the first step we must delete the contexts indicated by triangles (such 
as c) in Figure [5] because otherwise we can apply the method used in the proof to derive a contradiction (it 
relies on the existence of a linking point inside such a context c). Thus, in essence we must first implement a 
variant of the l-xt R Mi of Example [2] It is a simple exercise to show that the deletion of the excess material 
cannot be done by a single 1-xt as it cannot reliably determine the left-most occurrence of o<i without the 
look-ahead. Thus, if we only have 1-xt to achieve the transformation, then we already need a composition 
of two 1-xt to perform the required deletion of the contexts, which proves the main statement. □ 

In Table [3] we summarize the results of this and the previous section, which allow us to present the 
least power at which the closure of the considered composition hierarchies is achieved. For the sake of 
completeness, we also present the corresponding results for the classes ^snl-XT and B(l, 1) that were obtained 
in [3j, |6|. Recall that B(l, 1) is the class of all tree transformations computable by bimorphisms, in which 
both tree homomorphisms are linear. 



6. Infinite composition hierarchies 

To complete the picture, we will need one further result showing the infiniteness of the composition 
hierarchy for a large number of classes. In order to obtain a result that is as general as possible, we use 
bimorphisms Q instead of l-xt R in this section. We conclude several results for various tree transducer 
classes from the result for bimorphisms. 

The main auxiliary notion used in the proof of the infiniteness of the composition hierarchy is a notion 
assigning levels to positions in a tree. Let t € Ts and I £ N be an arbitrary integer. Since branching 
positions (i.e., those that are labeled by symbols of rank at least 2) will play an essential role, we define the 
set of all branching positions of t together with two different successors and the branching positions along 
a given path. For every w,ww" G pos(t), let 

br f = {(w,i,j) | w e pos(t), t(w) <£ E USi, 1 < i,j < vk(t(w)), i ^ j} 
br t (w,w") = {ww' | w ^ w" and (ww',i,j) S br t for some i,j} . 

We inductively define the sets SP„(t) C pos(t) x N x N and P„(t) C pos(f) for every t £ Ty, and n G N as 
follows: 

SP^) = br t and P e (t) = {w \ («,,», j) £ SP$(*)} 

SPl+iW ={(w,i,j) ebT t | 3w x €Pl(*U): |br t (ii»,tUi)nPl(t)| >£ n+ \ 

3w 2 e Pl(tU-): \br t (wj ) w 2 )nP e n (t)\ > i n+1 } 
Pn+iW = & I (w,i,j) eSPl +1 (t)} . 
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Figure 6: Tree used in Example 1271 



Intuitively, all branching positions have level (for any distance £) and a branching position w has level n+1 
if there are 2 paths in different direct subtrees below w that both have at least £ n+1 branching positions of 
level n along the path. Clearly, SP* +1 (*) C SP* n (t) and P* +1 (i) C P^(i) for all n G N and t G T s . 

Example 27. Let i be tfte tree depicted in Figure® Then 

Pl(t) = {e, 1, 11, 112, 1121, 11211, 12, 121, 2, 21, 211, 2111} = br f 

P?(t) = {M} 
Pl(t) = . 

Lemma 28. Let (p: Tr — > Ta be a complete tree homomorphism. Moreover, let i e Tr and ^, n G N. If 
(e,«,j) G SP*(i), then there exists (v,i',f) G SP^(^(t)) such that 

• « G pos(<p(*(e))), 

• ui' ^ vi for some «i G pos x . (ip(t(e))), and 

• «i' d ^2 for some w 2 G pos^. (y>(t(e))). 

Proof. Let t = 7(^1, . . . , ifc) with 7 G and ti, . .. G Tr, and let u = f{t). We prove the statement 
by induction on n. In the induction base, we have n = and (e,i,j) G SPq(£). We arbitrarily select 
v\ G pos^.. (93(7)) and «2 G pos x .(y?(7)), which are positions of the variables Xi and Xj in (^(7), respectively. 
Since y> is complete, such positions exist. Let v = lcp(«i, 1)2) be the longest common prefix, and let v\ = vi'v[ 
and v 2 — vj'v' 2 . Clearly, v G pos(<y9(7)) and (v,i',j') G br^m because i»i and v 2 are in the distinct direct 
subtrees i 1 and j' below v. Trivially, vi' ^ v\ and uj' ^ «2; which completes the induction base. 

In the induction step, let (e,i,j) G SP n+1 (i) and w\ G P n (ii) and w 2 G P„(ij) be the required positions 
of level n such that 

\bx t {i,wi) nP£(t)| > T+ 1 and |br t (j,«*)nP£(t)| > T+ 1 . 

We observe that 

u = yj(i) = ip{i)[ip{h), ... ,¥>(**;)] • 

Now, we follow a similar approach as in the induction base. Since (p is complete, the tree u contains the 
subtrees (p(ti) and ip(tj) at any of the positions v\ G pos Xi (<^(7)) and v 2 G pos x .(y>(7)), respectively. As 
before, we let v — lcp(t>i,V2) G pos(<p(7)) be the longest common prefix, and let v% — vi'v[ and v 2 — vj'v' 2 . 
Clearly, (v,i',f) G br u is a branching position. It remains to show that (v,i',j') G SP„ +1 (it) because the 
remaining conditions of the lemma are already fulfilled. 

Due to the fact that w\ G P„(£j), we have (s,i",j") G SP e n (U\ Wl ) for some G N. Moreover, since 

ip is complete, the subtree ip(ti\ Wl ) occurs in u\ Vl . By the induction hypothesis, there exist v'{ G P n (ip(ti\ Wl )). 
Moreover, v\v'v'{ G P„(w) for some v' G pos(u|„ 1 ) such that u\ v i = (p(U\ Wl ). We will only verify the condition 

\hv u (vi',v' 1 v'v'{)r\Pi(u)\>£ n+1 (1) 
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because the proof for the second path works analogously. 

Let w[ G br t (i, wi) flP^(i) be any position of level n along the path from i to iw\ such that w[ -< iwi . Let 
i'" G N be the unique integer such that w'^i'" -< iw%. Since w' 1 , iw± G P e n (t) we can conclude that there exist 
G N such that i'" ^ j" and that (w[,i",j") G SP£(t). Then (w[,i'",j") G SP£(t) because Wi G P£(i). 
In other words, (e,i'",j") G SP„(t| w j). Since <^ is complete, the translation tpit^) occurs in ip(ti). By the 

induction hypothesis, there exists (v'",ii,jx) G SP^(<^(i|„,j )), which yields (v"v'",ii,ji) G SP^(<^(ii)) for 
some position v" G pos(ip(ti)) with = <^(t„,j), such that v'v"v"'ix ^ uj. Clearly, 

VlV ' v " v "' e br„(ui / , wJw'O n P£(u) . 

Moreover, the two conditions that i/" G poa((p(t(w[))) and that is complete guarantee that for each 
selection of w' x we obtain a different position v\v'v"v'" G br u (vi' jV^v'v") RP„(tt). This verifies ([T]), because 
there are at least £ n+1 possible selections of w'±. □ 

Lemma 29. Let ip : Tr — > T% be a linear tree homomorphism. Moreover, let t G Tp and £, n G N be such 
that i > ht(V>(7')) for all symbols 7' G T. If («, f) G SP^ +1 (V>(*)) with v G pos(V>(t(£))), then there exists 
(e,i,j) G SP^(t) such that 

• vi' < vi for some G pos Xi (ip(t(e))) and 

• vj' ^ W2 for some w 2 G pos^^. (-0(t(e))). 

Proof. Let t = 7(^1, . . . , tk) with 7 G Tfc and t±, . . . ,tk G Tr, and let it = ip(t) be its image. Moreover, let 
(v,i',j') G SP^ +1 (u) with v G pos(V>(7))- By definition, there exist long paths in u starting at vi' and vj', 
which are longer than £, which in turn is longer than any path in ip(l') f° r every 7' G T. Consequently, 
there exist 1 < i,j ' < h and 

Vl G POS Xi (^(7)) and v 2 G pos X3 (^(7)) 

such that vi' ^ v\ and vj' ^ vi. Since if) is linear, we know that i ^ j. It remains to prove that 
(e,i,j) G SPf t (i) as the additional properties of the lemma are already satisfied. We prove this remaining 
property by induction on n, and the induction base is already proven as (s,i,j) G br t . 

We proceed with the induction step. By definition, there exist positions v[ G P^(it|t,i') and v' 2 G P n (u\ v ji) 
such that 

|br„(w>i) nP£(«)| > r +1 and |br u (uj',^) n P^(u)| > r +1 . 

Let w\ G pos(^i) be the position that creates the symbol u(vi'v[). We only prove the required property for 
the path br t (i,wi). 

Since w\ creates the symbol u(vi'v[), there exists a position v" G pos(it) such that vi' ^ v" ^ vi'v[ 
and u\ v n = ip(t\ Wl ). Moreover, by v[ G P £ n (u\ V i>) there exists (v'{,ii,j'x) G SP^(ii|„») with vi'v[ = v"v", 
which yields that v" G pos(V>(i(u>i))). With the help of the induction hypothesis we conclude the existence 
of (e, i\,j\) G SPf i (t| UJl ), and thus wi G P e n (t). Finally, for every v' G bi u (vi' , v[) nP^(w). Let w' G pos(^) 
with i < w' < wi be the position that creates the symbol u(v'), and let v" G pos(w) be the position such 
that vi' ^ v" ^ v' and u\ v » — ip(t\ w >). Since v' G P„(u) there exists (-y", , G SP^u^") with v' — v"v", 
which yields that v" G pos(ip(t(w'))). Moreover, let i'[ G N be the unique integer such that v'i'[ < vi'v[. 
Consequently, (u", i", j[) G SP^(u|„") because vi'v[ G P l n (u). By the induction hypothesis, there exists 
(e,i\, ji) G SP e n (t\ w >) such that 

• v"i" ^ v'i for some v'{' G pos^..^ (ip(t(w'))) and 

• Uiil ^ v'i' for some < G pos^ (ip(t(w'))). 

Moreso, we have w' G br t (i,wi) PI Pfj(i), and ^(^1^,'^) = for some position vi' ^ u'" ^ vi'v[, which 
determines another position of bv u (vi' ,v[). Since at most £ positions of bi u (vi' ,v[) can be created by a 
single symbol of t and |br u (vi, -i^) [~1 P^(w)| > - £+1 = {£" - 1 + Consequently, we obtain at 

least £ n positions in br t (i, w\) PI P^(i), which completes the induction step and the proof. □ 

Next, we combine the previous two lemmas into the main statement that will be used to prove the infinity 
of several composition hierarchies. In essence, we show that a bimorphism in B(l, c) can reduce the level by 
at most 1 (while keeping the distance £). 
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Figure 7: Illustration of the 3-fold composition of the tree transformation computed by the ^hl-xt M of Example 1321 



Theorem 30. Let B = (ip,L,(p) be a bimorphism such that ip: Tp — > Is is linear and </?: Tr — > Ta is 
complete. Moreover, let (i, u) 6 £?, and let ^ G N be such that I > ht(V>(7)) f° r every 7 e T. If P^ +1 (i) / 0, 
then p£(u) ^0. 

Proof. Since (t, u) € i3, there exists s£i such that tp(s) — t and 93(a) = u. By assumption, we have that 
Pf l+1 (?/>(s)) 7^ 0, which by Lemma 1291 yields that P„(s) 7^ 0. Using the completeness of ip and Lemma 1251 
we obtain that V l n (u) ^ as desired. □ 

Now we can simply chain Theorem[30]to show that an n-fold composition of tree transformations of B(l, c) 
can decrease the level by at most n (for a suitable distance £). 

Corollary 31 (of Theorem I30|) . Let n > 1, and for every 1 < i < n, let B. L = (ipi, Li, (pi) be a bimorphism 
such that -0i is linear and ipi is complete. Moreover, let ipt : Tp i — > TAi and : 2r i+1 — > Ta ; for every 
1 < i < n. Finally, let I € N be such that I > ht(^i(7)) f° r every 1 < i < n and 7 e T,, and let 
(t,u) € Si ; • • • ; B„. If Pf i+1 (<) # 0, then (u) ^ 0- 

It remains to demonstrate a tree transformation that can be computed by (n+1) ^nl-xt and that reduces 
the level of positions from n + 1 to 0. Clearly, this tree transformation cannot be computed by an n-fold 
composition of tree transformations from B(l, c) because the output tree should contain a position of level 1 
by Corollary [22 We make sure that the assumptions of Corollary I3"T1 are satisfied. 

Example 32. Let M = (Q, E, S, {*}, R) be the $nl-xt such that 
• <9 = and £ = {cr( 2 ),a^}, and 



19 



• R contains exactly the following rules 



a(*, a) —4 * a(-k, q) —4 cr(-k, q) a — H> a 

The 3-fold composition M ; M ; M can compute the transformation indicated in Figure^ 

We use the tree transformation computed by the ^hl-xt M of Example 1321 and show that n transforma- 
tions from B(l,c) cannot compute the tree transformation M n+1 . 

Lemma 33. For every n > 1, 

ghl-XT n+1 % B(l,c)" . 

Proof. Let Lq = {a}. For every i > 1, let Ci and Li be the smallest sets such that 

• o~(xi,t) G Ci for every t G Li-\ and c[c'] G Ci for all c, c' £ C and 

• Li — {c[a\ | c £ C*,;}. 

It is easy to see that n + 1 compositions of the tree transformation M computed by the ghl-xt M of 
Example 1321 can relate L n+ 2 and L\ (see Figure [7]); more precisely, for every t G L n+ 2 there exists u G L\ 
such that (t, u) G M n+1 . With the help of Corollary [3T] we can complete the proof if P„ +1 (£) =/= for some 
t G L n+ 2 and suitably large I G N. 

Let £ G N be fixed. Thus, we now prove that for every n G N there exists a tree t G i n +i such that 
P„(t) 7^ by induction on n. In fact, we prove the stronger statement that there exists t G L n+ i and 
w G pos(i) such that |br t (e, w) C\¥ l n (t)\ > £ n+1 . For n = 0, we select the tree c e [a] G £i, where c = cr(xi, a), 
and the position w = Since Po(i) = br t (e, u>), the selection of i and w fulfills the requirements. In 

the induction step, there exist a tree t G L n +i an( l u> G pos(t) such that |br t (e, w) n P„(t)| > ^ n+1 . We 
consider the tree t' = c e [c[a]] with c = cr(a;i,i) and the position w' — X 1 _1 . Obviously, G i n +2 and 
w" G P n+1 (i') for every w" -< w' (via the paths 12u> and 2u>), which completes our induction and proof. □ 

Now we are able to prove that the composition hierarchy of ^nl-XT and several other classes is infinite. 

Theorem 34. For every n > 1, 

B(Lc) n C B(l,c)" +1 
^nl-XT™ C ^nl-XT n+1 

Proof. First, we note that all inclusions are trivial 
Theorem 17] we have B(lcs,lc) = ^hl-XT, hence 

jzM-XT" C B(l,c)" and /nl-XT n+1 C B(l,c) n+1 . 

These facts together with Lemma [33] imply the strictness of the two inclusions on the left. Moreover, 

■•• C ([s][n]l-XT R )" C [s][n]l-XT" +1 C ([s][n]l-XT R ) n+1 C [s][n]l-XT n+2 C ••• 

for all n > 1, which shows that the composition hierarchy of [s] [n]l-XT is infinite if and only if the composition 
hierarchy of [s][n]l-XT R is infinite. Thus, it remains to show the latter property. 

Using simple symmetry, we observe that snl-XT = ^nl-XT -1 , which together with the symmetric version 
of Lemma [33J yields snl-XT" +1 % B(c,l)". Furthermore, B(lc,l) = 1-XT R by [19, Theorem 4], so naturally, 

([s][n]l-XT R ) n C B(c,l) n and snl-XT" +1 C ([s][n]l-XT R ) n+1 

for all n>l. Taken together with snl-XT" +1 % B(c,l)™ we obtain the desired strictness. □ 

Table 2] summarizes our results of this section. For the sake of completeness, we mention some additional 
results from the literature, where T stands for the class of all tree transformations computable by top-down 
tree transducers [7], and ^-XT stands for the class of tree transformations computable bye-free extended 
top-down tree transducers [la ]. The mentioned result g'-XT C T 2 can be concluded from (m Theorem 4.8]. 



]s][n]l-XT" C [s][n]l-XT" +1 
([s][n]l-XT R )" C (]s][n]l-XT R ) ,l+1 . 

. Thus, we only need to argue the strictness. By [lj| 
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Class with infinite composition hierarchy 


Proved in 


B(l,c), jzln-XT, [s][n]l-XT, [s][n]l-XT R 


Theorem |M] 


T 

^-XT 
B(c,c) 


[9, Theorem 3.14] 
^-XT C T 2 and (T n | n > 1) is inhnite 
[1] and [6, Section II-2-2-3-4] 



Table 4: Summary of the results of Section l6l 
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